85 research outputs found
Salient movies
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1995.Includes bibliographical references (leaves 64-65).by Karrie Karahalios.M.Eng
Enhancing Child Vocalization Classification in Multi-Channel Child-Adult Conversations Through Wav2vec2 Children ASR Features
Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder that often
emerges in early childhood. ASD assessment typically involves an observation
protocol including note-taking and ratings of child's social behavior conducted
by a trained clinician. A robust machine learning (ML) model that is capable of
labeling adult and child audio has the potential to save significant time and
labor in manual coding children's behaviors. This may assist clinicians capture
events of interest, better communicate events with parents, and educate new
clinicians. In this study, we leverage the self-supervised learning model,
Wav2Vec 2.0 (W2V2), pretrained on 4300h of home recordings of children under 5
years old, to build a unified system that performs both speaker diarization
(SD) and vocalization classification (VC) tasks. We apply this system to
two-channel audio recordings of brief 3-5 minute clinician-child interactions
using the Rapid-ABC corpus. We propose a novel technique by introducing
auxiliary features extracted from W2V2-based automatic speech recognition (ASR)
system for children under 4 years old to improve children's VC task. We test
our proposed method of improving children's VC task on two corpora (Rapid-ABC
and BabbleCor) and observe consistent improvements. Furthermore, we reach, or
perhaps outperform, the state-of-the-art performance of BabbleCor.Comment: Submitted to ICASSP 202
You can't always sketch what you want: Understanding Sensemaking in Visual Query Systems
Visual query systems (VQSs) empower users to interactively search for line
charts with desired visual patterns, typically specified using intuitive
sketch-based interfaces. Despite decades of past work on VQSs, these efforts
have not translated to adoption in practice, possibly because VQSs are largely
evaluated in unrealistic lab-based settings. To remedy this gap in adoption, we
collaborated with experts from three diverse domains---astronomy, genetics, and
material science---via a year-long user-centered design process to develop a
VQS that supports their workflow and analytical needs, and evaluate how VQSs
can be used in practice. Our study results reveal that ad-hoc sketch-only
querying is not as commonly used as prior work suggests, since analysts are
often unable to precisely express their patterns of interest. In addition, we
characterize three essential sensemaking processes supported by our enhanced
VQS. We discover that participants employ all three processes, but in different
proportions, depending on the analytical needs in each domain. Our findings
suggest that all three sensemaking processes must be integrated in order to
make future VQSs useful for a wide range of analytical inquiries.Comment: Accepted for presentation at IEEE VAST 2019, to be held October 20-25
in Vancouver, Canada. Paper will also be published in a special issue of IEEE
Transactions on Visualization and Computer Graphics (TVCG) IEEE VIS
(InfoVis/VAST/SciVis) 2019 ACM 2012 CCS - Human-centered computing,
Visualization, Visualization design and evaluation method
Characterizing Scalability Issues in Spreadsheet Software using Online Forums
In traditional usability studies, researchers talk to users of tools to
understand their needs and challenges. Insights gained via such interviews
offer context, detail, and background. Due to costs in time and money, we are
beginning to see a new form of tool interrogation that prioritizes scale, cost,
and breadth by utilizing existing data from online forums. In this case study,
we set out to apply this method of using online forum data to a specific
issue---challenges that users face with Excel spreadsheets. Spreadsheets are a
versatile and powerful processing tool if used properly. However, with
versatility and power come errors, from both users and the software, which make
using spreadsheets less effective. By scraping posts from the website Reddit,
we collected a dataset of questions and complaints about Excel. Specifically,
we explored and characterized the issues users were facing with spreadsheet
software in general, and in particular, as resulting from a large amount of
data in their spreadsheets. We discuss the implications of our findings on the
design of next-generation spreadsheet software
Facilitating multisyllabic productions & assessing sympathetic arousal in children with developmental disorders
Speech-language impairments represent one of the most common developmental disorders, ranging from 1.3-14.3%. In particular the ability to combine syllables represents an important developmental milestone that is delayed or impaired in a variety of clinically-identified populations. However, evidence to support specific treatment practices in this area is relatively sparse. In addition, limited information is available regarding how children's sympathetic arousal is associated with interventions. Recent technological advances in electrodermal activity (EDA) interfaces, as seen in the Q sensor (Affectiva, 2012), provide the opportunity to conduct in situ EDA assessments. EDA is sensitive to both cognitive and emotional states and processes, thereby offering the potential to derive information regarding children’s internal states during intervention. The present study 1) examined the effectiveness of an integrated speech-language intervention in increasing children's multisyllabic productions, 2) assessed the associations between in situ EDA and off-line behavioral coding of emotional valence, and 3) examined the association among different EDA measures.Ope
- …